Maximally selected chi-square statistics and binary splits of nominal variables.
نویسنده
چکیده
We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or an ordinal X, but not when the best split is chosen from a nominal X. In this paper, we derive the exact distribution of the maximally selected chi-square statistic in this case using a combinatorial approach. Applications of the derived distribution to variable selection and hypothesis testing are discussed based on simulations. As an illustration, our method is applied to a birth data set.
منابع مشابه
Maximally selected chi-square statistics for ordinal variables.
The association between a binary variable Y and a variable X having an at least ordinal measurement scale might be examined by selecting a cutpoint in the range of X and then performing an association test for the obtained 2 x 2 contingency table using the chi-square statistic. The distribution of the maximally selected chi-square statistic (i.e. the maximal chi-square statistic over all possib...
متن کاملMaximally selected chi-square statistics for at least ordinal scaled variables
The association between a binary variable Y and a variableX with an at least ordinal measurement scale might be examined by selecting a cutpoint in the range of X and then performing an association test for the obtained 2 × 2 contingency table using the χ2 statistic. The distribution of the maximally selected χ2 statistic (i.e. the maximal χ2 statistic over all possible cutpoints) under the nul...
متن کاملMaximally selected chi-square statistics and non-monotonic associations: an exact approach based on two cutpoints
Binary outcomes that depend on an ordinal predictor in a non-monotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of a positive outcome is particularly high (or low). A chi-square test may then be performed to compar...
متن کاملMaximally selected Chi-squared statistics and non-monotonic associations: An exact approach based on two cutpoints
Binary outcomes that depend on an ordinal predictor in a non-monotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of a positive outcome is particularly high (or low). A Chi-squared test may then be performed to compa...
متن کاملسری آمار: تحلیل جداول توافقی 1 (آزمونهای کایدو)
Assessing of outcomes and risk factors in the form of qualitative variables is common in the most of medical studies and the research objectives are defined as the relationship between these variables. This paper introduces the concepts and basic and applied statistical tests to examine the relationship between these variables in these studies, including chi-square tests. Principles and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Biometrical journal. Biometrische Zeitschrift
دوره 48 5 شماره
صفحات -
تاریخ انتشار 2006